Growing pains for global monitoring of societal events.

نویسندگان

  • Wei Wang
  • Ryan Kennedy
  • David Lazer
  • Naren Ramakrishnan
چکیده

T here have been serious efforts over the past 40 years to use newspaper articles to create global-scale databases of events occurring in every corner of the world, to help understand and shape responses to global problems. Although most have been limited by the technology of the time (1) [see supplementary materials (SM)], two recent groundbreaking projects to provide global, real-time “event data” that take advantage of automated coding from news media have gained widespread recognition: International Crisis Early Warning System (ICEWS), maintained by Lockheed Martin, and Global Data on Events Language and Tone (GDELT), developed and maintained by Kalev Leetaru at Georgetown University (2, 3). The scale of these programs is unprecedented, and their promise has been reflected in the attention they have received from scholars, media, and governments. However, they suffer from major issues with respect to reliability and validity. Opportunities exist to use new methods and to develop an infrastructure that will yield robust and reliable “big data” to study global events— from conflict to ecological change (3). Automated event coding parses individual sentences into SUBJECT VERB OBJECT format and categorizes the action using a framework like CAMEO (Conflict and Mediation Event Observations). So a statement like “Secretary of State John Kerry complained about Russia’s support of Syria’s Bashar al-Assad” would be coded as US GOVERNMENT/DISAPPROVE/RUSSIAN GOVERNMENT. This can be refined into a numeric level of hostility or cooperation by using scales like the Goldstein Score. Whereas CAMEO focuses on categories for international and domestic conflict, similar frameworks could be developed for almost any kind of interaction in news media (e.g., transactions between businesses or debates over scientific findings). Uses for the resulting data have been manifold. Hand-coded and automated event data have been used to anticipate conflict escalation (2). When combined with statistical and agent-based models, ICEWS claims a forecasting accuracy of 80%. GDELT has been used to track, e.g., wildlife crime and the rise of hate speech following the U.K. Brexit vote. There are several challenges in the current approach. First, the focus on sentences removes a great deal of context. Event occurrences do not neatly partition into sentences. This lack of context, for example, often fails

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Power and Priorities: The Growing Pains of Global Health; Comment on “Knowledge, Moral Claims and the Exercise of Power in Global Health”

Shiffman has argued that some actors have a great deal of power in global health, and that more reflection is needed on whether such forms of power are legitimate. Global health is a new and evolving field that builds upon the historical fields of public and international health, but is more multi-disciplinary and inter-disciplinary in nature. This article argues that the distribution of power ...

متن کامل

How the Spectre of Societal Homogeneity Undermines Equitable Healthcare for Refugees; Comment on “Defining and Acting on Global Health: The Case of Japan and the Refugee Crisis”

Recourse to a purported ideal of societal homogeneity has become common in the context of the refugee reception crisis – not only in Japan, as Leppold et al report, but also throughout Europe. Calls for societal homogeneity in Europe originate from populist movements as well as from some governments. Often, they go along with reduced social support for refugees and asylum seekers, for example i...

متن کامل

The Untapped Power of Soda Taxes: Incentivizing Consumers, Generating Revenue, and Altering Corporate Behavior

Globally, soda taxes are gaining momentum as powerful interventions to discourage sugar consumption and thereby reduce the growing burden of obesity and non-communicable diseases (NCDs). Evidence from early adopters including Mexico and Berkeley, California, confirms that soda taxes can disincentivize consumption through price increases and raise revenue to support government programs. The Unit...

متن کامل

Restless legs syndrome (Willis-Ekbom disease) and growing pains: are they the same thing? A side-by-side comparison of the diagnostic criteria for both and recommendations for future research.

There has been no previous side-by-side comparison of the diagnostic criteria for restless legs syndrome (RLS) (Willis-Ekbom disease) and growing pains. In our review, we explore this comparison emphasizing overlaps and disconnects, summarize recent literature exploring the relationship between the 2 entities, and make suggestions for future research. There is considerable overlap in the diagno...

متن کامل

Analysis of events of dust using satellite monitoring and synoptic analysis in southwest Iran

Extended abstract 1- Introduction Dust storms are a kind of severe natural disaster indust source regions, which have a negative impact on human health, industrial products and activities. Iran is a dry  and low water country, the coincidence of this situation and its position in the global rebound belt has brought about very bad conditions. Repeaters in recent years have been affected by the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Science

دوره 353 6307  شماره 

صفحات  -

تاریخ انتشار 2016